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PROCESS FOR OBTAINING RECOMBINED NUCLEOTIDE SEQUEN 
IN VITRO, LIBRARIES OF SEQUENCES AND SEQUENCES T 
OBTAINED 

5 The present invention relates to a method of obtaining recombined 

nucleotide sequences in vitro. The invention is particularly aimed at 
generating and then selecting polynucleotide sequences which are liable to 
have one or several advantageous properties as compared to the 
corresponding properties of reference sequences and therefore capable of 

10 conferring an improved phenotype and/or of producing improved proteins. 

By reference sequence is understood a sequence having properties 
close to those being sought. 

15 Different techniques have been developed in order to favor the in vitro 

recombination between different polynucleotide sequences, among which can 
be more particularly cited DNA-shuffling (12) and StEP (14), both based on 
the use of PCR. 

20 DNA-Shuffling comprises two steps, the random fragmenting by 

DNAse I of polynucleotide sequences, and an amplification by PCR in which 
the fragments previously generated serve as primers. At each step of 
hybridization, the change of template provokes recombinations at the level of 
the regions having homologous sequences. A schematic representation of 

25 this method is provided in figure 1A . 

The step consists of mixing different polynucleotide sequences 
containing various mutations in the presence of a pair of primers. This 
mixture is subjected to a PCR type of reaction in which the hybridization and 
30 polymerization steps are grouped together in a single step of very short 
duration. These conditions permit hybridization of primers but reduce the 
duration hence the length of polymerization, in such a way that the fragments 
which are partially synthesized are randomly hybridized to the polynucleotide 
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sequences having different mutations, thus permitting the recombination. A 
schematic representation of this method is provided at figure 1B. 

In each of these two methods, the polymerization step is indispensable 
5 to the process of recombination. Thus, according to the polymerase chosen, 
this polymerization step can bring about undesired supplemental mutations. 
Moreover, the DNA-shuffling and the step procedures rest on the principle of 
the hybridization of a "mega-primer" (6) to a template, which likely leads to 
difficulties in implementation for polynucleotide sequences of size greater than 
10 1.5 kbp (11). Finally, these two techniques do not permit the control of the 
rate of recombination, since they are made randomly in the course of 
successive polymerization steps. 

The present invention solves the disadvantages described above by 
15 offering a simple method for the preparation of recombined polynucleotide 
sequences, permitting the generation of polynucleotide sequences capable of 
displaying advantageous properties as compared to the corresponding 
properties of the sequences of reference and therefore capable of conferring 
an improved phenotype and/or producing improved proteins. 



25 



30 



This goal is attained thanks to a in vitro process for obtaining 
recombined nucleotide sequences starting from a library of polynucleotide 
sequences, also designated hereinafter as the initial library, characterized in 
that it includes the following steps: 

a) fragmenting a library of double-stranded polynucleotide 
sequences, 

b) the denaturation of fragments possibly in the presence of one or 
several assembling templates, 

c) the hybridization of said fragments with one or several 
assembling templates if the template/templates is/are not 
present in step (b), 
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d) the ligation of said fragments in order to obtain recombined 
polynucleotide sequences, 

e) the selection of recombined nucleotide sequences having 
advantageous properties as compared to the corresponding 

5 properties of one or several reference sequences. 

The process of the invention can include at the output of step (d) and 
before step (e), the repeating of steps (b), (c) and (d) no longer with the 
fragments of step (a) but with the products of step (d). 

This embodiment is particularly useful in the case where, at the outlet 
of step (d) all the fragments are not ligated. In this case, the process of the 
invention moreover includes, at the end of step (d) and before step (e), one or 
several of the following reaction cycles: 

denaturation of the ligated and non-ligated fragments coming 
out of step (d), possibly in the presence of one or several 
assembling templates. 

hybridization of said fragments with one or several assembling 
templates if it (they) is(are) not present at the time of the 
denaturation, 
ligation of said fragments. 

These denaturation, hybridization and ligation reactions are equivalent 
in steps (b), (c) and (d) but carried out not with the fragments of step (a) but 
25 with the ligated and non-ligated fragments coming out of step (d). 

The process of the invention can moreover include one or several of 
the following steps: 

the separation of the recombined polynucleotide sequences 
30 from the assembling template or templates before step (e), 

the amplification of the double-stranded recombined 
polynucleotide sequences before step (e). 
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the cloning of the recombined polynucleotide sequences 
possibly after separation of the recombined strands from the 
template or templates and obtaining of the corresponding double 
strand before step (e). 
5 The ends of the fragments generated at step (a) are such that there 

can be adjacent hybridization of these ends to the assembling template or 
templates at step (c) and ligation of these fragments with each other at step 
(d). The polynucleotide sequences of the library on which the process of the 
invention is carried out must include zones of homology either between them 
io or with the assembling templates, so as to permit the generating of the ends 
of the fragments such as described above. 

SI One advantageous embodiment of the process of the invention 

i]\ consists in simultaneously carrying out steps (c) and (d) according to a 

?* ? 15 reaction called RLR for the English expression of "Recombining Ligation 

m Reaction." 

:t; 

Besides the advantages previously indicated, the process of the 
invention is notable in that it favors and accelerates the random recombination 
in vitro of polynucleotide sequences and these polynucleotide sequences can 
be genes. By gene is understood a DNA fragment or sequence associated 
with a biological function. A gene can be obtained in different manners, such 
as chemical synthesis, synthesis by polymerization or by extraction of said 
gene starting with a source of nucleic acids. 

The in vitro recombination of the polynucleotide sequences of the initial 
library by the process of the invention therefore permits the obtaining of a new 
library containing sequences having acquired one or several characteristics of 
the sequences of the previous library. The process of the invention therefore 
comprises an in vitro technique of evolution. 

The process of the invention comprises an alternative to recombinant 
PCR such as implemented in the techniques of DNA shuffling (12) or of StEP 
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(14), since it does not require the in vitro polymerization step in order to end 
up with the recombination. To the contrary, the key step of the process of the 
invention is the step (d) of ligation on an assembling template, which assures 
a very high degree of faithfulness in the course of the recombination events. 

5 

The process of the invention is notable in that it permits a considerable 
increase in the efficiency of the reassembling of the fragments to ligate. In 
effect, in the case of a sequence cut up into n fragments, there exists n n 
possibilities of reassociation of the fragments by using a classical process of 
10 ligation (without using a reassembling template which directs the ligation), 
among which a single form is of interest. In the case of the process of the 
invention, the ligation is directed by the assembling template, which permits to 
direct obtention of the single form of interest. 

15 The fragmenting of these polynucleotide sequences at step (a) can be 

done either in a controlled manner or in a random manner. 

In the case of a fragmenting carried out in a controlled manner, the 
fragmenting permits controlling with precision the degree of desired 

20 recombination and the position of the points of recombination. According to a 
preferred embodiment of the process of the invention, step (a) consists of 
subjecting the polynucleotide sequences of the library to hydrolysis by the 
action of one or several restriction enzymes. In this way, in one particular 
embodiment of the process of the invention, the degree of recombination and 

25 the position of the points of recombination of the recombined polynucleotide 
sequences are determined by the fragmenting of step (a). 

Thus, the larger the number of fragments generated by the sequence, 
the higher the number of fragments necessary to recompose a sequence, 
30 which leads to a higher rate of recombination. Moreover, the nature and the 
position of the ends of the fragments generated in this embodiment of the 
process of the invention can be known and controlled, which permits: 
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controlling with precision the zones where the recombination 
takes place, or 

inducing recombination between polynucleotide sequences, for 
example genes, if the ends of the fragments are created in 
zones of homology between these sequences, or in zones of 
homology between these sequences and the assembling 
template or templates. 

In the case of a random fragmenting, any enzymatic or mechanical 
means known to a person skilled in the art allowing a random cutting of the 
double-stranded DNA can be used, such as for example DNAse I digestion or 
sonication. 

The process of the invention, which permits a considerable increase in 
the efficiency of reassembling fragments to be ligated, can therefore be 
applied to the directing of blunt end multi-molecular ligation. In this 
application, single or double-stranded oligonucleotides complementary only to 
the 3' end of a fragment and 5' of the adjacent fragment are used as the 
assembling template in steps (b) or (c), which permits adjacent hybridization 
of these two ends on the same template after the denaturation step. Once 
hybridized, the ends of the fragments can be ligated so as to direct the 
direction of the ligation of the blunt ended fragments . The same approach 
can be contemplated for the positioning of the ligation of fragments having 
sticky ends. 

An especially preferred embodiment of the process of the invention 
consists of adding at step (c) and/or at step (d) enzymes capable of 
recognizing and of cutting in a specific manner the non-hybridized ends of 
fragments, when these overlap with other hybridized fragments on the same 
template. A preferred example of this type of enzyme is the Flap 
endonuclease enzyme (10). 
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A particular embodiment of the process of the invention therefore 
consists of using enzymes of the Flap endonuclease type when the fragments 
generated at step (a) can be overlapping during the hybridization on the 
assembling template (of) at step (c). 

5 

In this way, during the hybridization of single stranded DNA fragments 
on a template, these enzymes have the property of recognizing and of cutting, 
in a specific manner, the non-hybridized ends of these fragments, when they 
overlap with other hybridized fragments on the same template. In the course 
10 of the hybridization step (c), these enzymes therefore permit increasing the 
number of adjacent ends that can be ligated at step (d), which is particularly 
important in the case of fragments obtained by random cutting, as these 
fragments have zones overlapping with each other when they are hybridized 
on the assembling template. 

15 

In a particular embodiment of the process of the invention using a 
ligase active at high temperature and preferably thermostable at step (d), the 
endonucleases, capable of recognizing and of cutting in a specific manner the 
non-hybridized ends of the fragments such as the Flaps, added at step (c) 
20 and/or at step (d) will have the same properties of thermoresistance and of 
high temperature activity as said ligase. 

The library of polynucleotide sequences on which the process of the 
present invention is carried out can be generated by any method known to a 

25 person skilled in the art, for example starting with a wild type gene, by 
successive steps of directed mutagenesis, by "error prone" PCR (2), by 
random chemical mutagenesis, by random in vivo mutagenesis, or by 
combining genes of close or distinct families within the same species or 
different species so as to obtain a variety of polynucleotide sequences in said 

30 library. 

Among these techniques, the invention more particularly contemplates 
a process in which the library of double-stranded polynucleotide sequences is 
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obtained by a polymerization chain reaction carried out under conditions that 
permit creation of random point mutations. 

The initial library of double-stranded polynucleotide sequences can be 
composed of synthetic sequences which will be fragmented at step (a) or 
which can be the fragments of step (a). 

According to a preferred embodiment for carrying out the process of 
the invention, step (a) consists of subjecting the polynucleotide sequences of 
the library to hydrolysis by the action of one or several restriction enzymes. 



In order to increase the degree of recombination generated by the 
process of the invention, it is sufficient to increase the number of restriction 
fragments by using restriction enzymes having a large number of cutting sites 
15 on the polynucleotide sequences of the library, or by combining several 
restriction enzymes. In the case of using a thermostable and thermoactive 
ligase, the size of the smallest fragment thus generated will advantageously 
be greater or equal to 40 bp, in order to retain a hybridization temperature 
compatible with that of the ligation step (d) which is generally of the order of 

20 65°C. 

Step (a) can also be carried out by generating a library of fragments by 
random enzymatic or mechanical treatment. In particular, step (a) can consist 
of a random treatment with DNAse I of a library of partially heterologous 

25 double-stranded polynucleotide sequences. In the case where a random 
enzymatic or mechanical fragmenting is used in step (a), this embodiment of 
the process of the invention has the characteristic of permitting the use of 
fragments generated by this treatment as templates for each other, for the 
hybridization in the course of step (c) or of the RLR reaction of the steps (c) 

30 and (d) simultaneously. 
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Step (b) can be carried out by combining at least two distinct libraries 
of fragments separately generated at step (a) starting from the same initial 
library by different treatments, as for example with different restriction 
enzymes. In the case of the embodiment of such libraries, the fragments 
5 obtained at step (a) are used as templates for each other, for the hybridization 
during the course of step (c) or the RLR reaction of steps (c) and (d) 
simultaneously. 

The fragments of step (a) of the process of the invention can be equally 

10 generated by amplification reactions such as PCR realized on the 
polynucleotide sequences of the library. Two solutions are contemplated. In 
a first case, the oligonucleotide primers can be conceived in a manner so as 
to generate fragments having ends which are adjacent all along the length of 
the assembling sequence. In a second case, the oligonucleotide primers are 

15 conceived in a fashion so as to generate fragments having common 
sequences, these fragments being capable of serving as an assembling 
template for each other at step (b) or at step (c). The process of the invention 
permits the combining in a random manner of different fragments obtained at 
step (a) and of reassembling them during steps (b), (c) and (d) within a 

20 polynucloetide sequence. This process therefore reproduces in vitro the 
recombination phenomena which can occur in vivo by favoring them. The 
process of the invention is therefore particularly of interest above all for 
recombining polynucleotide sequences among themselves in order to 
generate new polynucleotide sequences having interesting properties as 

25 compared to the corresponding properties of the reference sequences. 

The effectiveness of the recombination of the process of the invention 
depends on the number of fragments generated by a polynucleotide 
sequence at step (a). Consequently, the process of the invention will use 
30 polynucleotide sequences having been fragmented into n fragments, n 
advantageously being greater or equal to three. 
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The assembling template of step (b) or (c) is for example a 
polynucleotide sequence coming from the initial library or a consensus 
sequence of said library, single or double-stranded. In the case where the 
assembling template is incorporated directly at step (c) of the invention, this 
template must be single-stranded. 

According to a variant of the process of the invention, the assembling 
templates of steps (b) or (c) are composed of single or double-stranded 
oligonucleotides. 

According to a particular embodiment of the process of the invention, 
oligonucleotides, single or double-stranded, of variable length, are added at 
step (b) or (c) in addition to the template. These oligonucleotides are 
designed (to be capable of being) so as to be substituted for a portion of the 
fragments of step (c), in effect, their sequence is such that: 

if they are perfectly homologous with the sequence of the 
fragment which they are replacing, they favor certain 
combinations, or 

if they are partially heterologous with the sequence of the 
fragment that they are replacing, they introduce one or a more 
directed supplemental mutations. 

Before step (e) of the process of the invention, it is possible to separate 
the recombined polynucleotide sequences from the assembling template 
thanks to a labeling present in the assembling template or in the recombined 
polynucleotide sequences. It is in effect possible to label each strand of the 
template according to techniques known to a person skilled in the art. For 
example, the label of the assembling template can be a hapten and the 
recombined polynucleotide sequences templateare separated from the 
assembling template by techniques known to a person skilled in the art, such 
as for example an anti-hapten antibody boundon a support or a biotin- 
streptavidine reaction, if the hapten is a biotin label. 
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Other techniques can be employed in order to separate the 
recombined polynucleotide sequences from the assembling template. The 
assembling template can also be prepared specifically in a way so as to 
facilitate its elimination at the end of the process of the invention. It can thus 
be synthesized by PCR amplification using methylated dATP, which permits 
its degradation by the restriction endonuclease Dpn I. In this case, the 
recombined polynucleotide sequences must not contain methylated dATP. 
The template can also have been prepared by PCR amplification by using 
dUTP, which permits its degradation by treatment with a uracyl-DNA- 
glycosylase. Conversely, it is possible to protect the recombined 
polynucleotide sequences by amplifying them by selective PCR with 
oligonucleotides having 5' phosphorothioate groups. A treatment with an 
exonuclease then permits the specific digestionof the assembling template. 

The process of the invention can include before the possible cloning of 
step (e), a step of amplification of the recombined polynucleotide sequences. 
Any amplification technique is acceptable, notably a PCR amplification. One 
of the most simple consists of carrying out a PCR which permits specific 
amplification of the recombined polynucleotide sequences owing to primers 
which can only be hybridized at the ends of the recombined sequences. The 
PCR products are then cloned in order to be characterized and the 
polynucleotide sequences having advantageous properties as compared to 
the corresponding properties of the reference sequences are selected. 

The invention has for its object the generation of polynucleotide 
sequences liable to have advantageous properties as compared to the 
corresponding properties of reference sequences. The recombined 
polynucleotide sequences obtained at step (d) and possibly cloned are 
screened by any appropriate means in order to select the recombined 
polynucleotide sequences or the clones having advantageous properties as 
compared to the corresponding properties of the reference sequences. By 
advantageous property is understood to be, for example, the thermostability of 
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an enzyme or its abilityto function under conditions of pH or of temperature or 
of saline concentration more adapted to an enzymatic process than the 
control proteins usually used for said process. Forexample, such a process 
can be an industrial process to breakdown textile fibers or bleaching paper 
pulps or producing flavors in the dairy industry, the processes of biocatalysis 
for the synthesis by an enzymatic pathway of new therapeutic molecules, etc. 

According to an advantageous embodiment of the process of the 
invention, the polynucleotide sequence library can therefore be the result of a 
screening having permitted selection by any appropriate means of 
polynucleotide sequences having advantageous properties as compared to 
control sequences. The sequences selected in this way comprise a limited 

library. 

But, it is also possible to start from a non-limited library in order to 
preserve the representation of the properties contained in that library. 

The sequences coding for the protein or proteins having one or more 
advantageous properties as compared to the reference proteins are thus 
selected, by in vivo or in vitro screenings, and can be used to form a new 
library for a possible repeating of the process of the invention. One 
advantageous embodiment of the process of the invention therefore consists 
of using as the library several polynucleotide sequences selected after a first 
implementation of the process of the invention, possibly mixed with other 
polynucleotide sequences. Among the screening techniques which can be 
applied to each of the clones of step (e), the screening techniques in vitro give 
rise to the advantage of being free of problems of cellular physiology, and of 
any drawbacks tied to the in vivo expression cloning. Moreover, this type of 
screening is easily automated, which permits screening a higher number of 
recombined polynucleotide sequences. 

The invention also relates to a recombined polynucleotide sequence 
obtained by a process according to the invention, as well as to a vector 
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containing such a recombined polynucleotide sequence, a cellular host 
transformed by a recombined polynucleotide sequence or by a vector of the 
invention, as well as a protein coded by this recombined polynucleotide 
sequence. The invention includes as well the corresponding libraries of 
recombined polynucleotide sequences, vectors, cellular hosts, or proteins. 

Other advantages and features of the invention will appear from the 
examples of carrying out the invention, which follow and which are referred to 
in the attached drawings in which: 

Figure 1 is a schematic representation of the processes of the prior art 
corresponding respectively to DNA-shuffling (figure 1A) and to StEP (figure 
1B). 

Figure 2 is a schematic representation of an example of carrying out 
the process of the invention and of certain of its variations and applications. 

Figure 3 represents the positions of the ten zones of mutations {Pvu II 
and Pst I) carried by each mutant of the ponB gene used for the examples of 
the implementation of the invention. 

Figure 4 represents the position of the primers used as compared to 
the sequence of the ponB gene. 

Figure 5 represents the migration on agarose gel of RLR and of PCR 
reaction products of these RLR reactions. 

Figure 6 represents the position of the mutations as compared to the 
restriction fragments. 



I- EXAMPLE 

The process of the invention was put into practice starting from a 
library of gene mutants of ponB, coding for the PBP1b of E. coli (1). Ten 
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mutants of this gene were used. The gene sequence of each mutant differs 
from that of the native gene by a non homologous zone of thirteen to sixteen 
bases in length resulting from the substitution of five initial codons by five 
alanine codons according to the technique described by Lefevre et al. (8). 

The substitution carried by each mutant is characterized by the 
presence of a unique site of the restriction enzyme Pvu II surrounded by two 
Pst I enzyme sites, which permits the mutants to be distinguished from each 
other by their digestion profile with these restriction endonucleases. Figure 3 
represents the positions of the ten zones of mutations {Pvu II and Pst I) 
carried by each mutant. 

After PCR amplification of the genes of the ten mutants, the PCR 
products were purified, mixed in equimolar quantity in order to form the library. 
The polynucleotide sequences of this library were digested with the restriction 
enzymes Hinf I and Bsa I, in such a way as to generate libraries of restriction 
fragments. The restriction fragments were then incubated with various 
amounts of the wild type template, at different quantities, in the presence of a 
thermostable ligase. After several denaturation/hybridization/ligation cycles, a 
fraction of the reaction mixture was used to carry out a PCR amplification with 
a couple of primers specific to the 5' and 3' ends of the genes of the mutants 
and non-specific to the 5" and 3" ends of the wild type template. The 
amplification product was cloned and the obtained clones were analyzed for 
their digestion profile with the Pvu II or Pst I restriction endonucleases. The 
obtained profiles permitted the determination of which fragment(s) of the 
mutants had been able to be recombined with the others in order to form an 
entire gene. 

II MATERIALS 

1) Strains and plasmids 
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The strain MC1061 (F araD139, A (a ra-leu)7 696, galE15, galK16, A 
{lac)X74, rpsL (Str*), mcrA mcrB1, hsdR2 fr k " m/)) is derived from 
Escherichia coli K12. 



nit? 

v 

IV 

U! 

lij 

»p .1 T *t 

OS 



10 



15 



The vector pARAPONB stems from the vector pARA13 (3) in which the 
ponB gene carrying a thrombin-cutting site (9) was introduced between the 
restriction sites Nco I and Nar I. The vector pET26b+ is one of the pET 
vectors developed by Studier and Moffatt (13) and commercialized by 
NOVAGEN Corporation. 

2) Oligonucleotides 
The oligonucleotides were synthesized by ISOPRIM corporation 
(Toulouse). The oligonucleotide sequences are reported in Table I below. 



Table I 



u. 



Oligo N 


S-ACTCAGTAGCATOOCCOGGAATGACCGCGAGCC 3' 


Oligo E 


5U2CGCGGTGCACCCAi ft TT rTft ftTTftPTftPPAAAflATOTnC 


Oligo M1 


51GCCCCTGAATATTGGGGAGAAAAAOC 3' 


Oligo M2 


5' AGAAGGAGATGAAAAGAAAQQQTTAATATC-3' 


Oligo A1 




Oligo A2 


5' CCQCGGTGGAQCGAATTC 3' 



20 



31 



Reagents 



The restriction and modification enzymes cited in Table II below were 
used according to the recommendations of the suppliers. 
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Table II 
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Enzyme 


Concentration 


Supplier 


Ncol 


10 U/ul 


New England Biolabs 


Pstl 


20 U/ul 


New England Biolabs 


EcoRI 


20 u/ul 


New England Biolabs 


Bsa 1 


5 U/ul 

» 


New England Biolabs 


Hinf 1 


10 U/ul 


New England Biolabs 


Pvull 


1 0 U/jil 


New England Biolabs 


T4 DNA ligase 


400 U/|xl 


New England Biolabs 


Taq DNA polymerase 


5 U/ul 


PROMEGA 


AMPLIGASE 


1 00 U/ul 


EPICENTRE 



The buffers used are reported in Table III below 



Buffers 


Composition 




Tris HC1 10 mM, pH 8.0 


Polymerization 20X 


Tris HCI 100 mM pH 8.3, MgCI 2 15 mM, KCI 500 mM, 

1.0% TRITON X100® 


Restriction A 1 0X 


500 mM NaCI, 100 mM Tris HCI pH 7.9, 100mM 

MgCI 2 , 10mM DTT, 


Restriction B 1 0X 


1 M NaCI, 500 mM Tris HCI pH 7.9, 100 mM MgCI 2 , 

10 mM DTT 


Restriction C 1 0X 


500 mM NaCI, 1 M Tris HCI pH 7.5, 100 mM mM 
MgCI 2 , 0.25% TRITON X100® 


AMPLIGASE 10X 


200 mM Tris HCI pH 8.3, 250 mM KCI, 100 mM 
MgCI 2 , 5 mM NAD, 0.1% TRITON X100® 


Ligation 10X 


500 mM Tris HCI pH 7.5, 100 mM MgCI 2 , 100 mM 
DTT, 10 mM ATP, 250ug/ml BSA 



III PREPARATION OF THE TEMPLATE 
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The wild type ponS gene was amplified by a PCR reaction step by 
using as primers the oligonucleotides M1 and M2 (Fig. 4). Five PCR 
reactions were prepared by adding 50 ng of pPONBPBR plasmid carrying the 
wild type gene (7) to a mixture containing 10 ^il of polymerization buffer, 10 y\ 
of dNTPs 2mM, 20 pmol of each oligonucleotide M1 and M2, and 5U of Taq 
DNA polymerase, in a final volume of 100 These mixtures were incubated 
in Perkin-Elmer 9600 Thermocycler according to the following program: (94 
°C - 2 min.) - (94°C 15 sec. - 60°C 30 sec. -72°C 1 min.) x 29 cycles - (72°C 
- 3 min.). 



The product of the five PCR was mixed and loadedon a 1% 
TBEagarose gelAfter migration and staining of the gel with ethidium bromide, 
the band at 2651 bp, corresponding to the ponB gene amplification product 
surrounded by two fragments of 26 bp and 90 bp respectively, was visualized 
2 15 by trans-illumination under ultraviolet, and cut out with a scalpel in order to be 

purified with the QUIAquick system (QIAGEN). Allthe DNA thus purified was 
eluted in 120 \x\ of buffer T. The concentration of this DNA was 
approximatively 100 ng/pJ as measured by its absorbanceat 260 nm . 



U 20 IV PREPARATION OF THE LIBRARY 



1) Amplification of the mutant genes 



The genes of the ten mutants were separately amplified by a PCR 
25 reaction witholigonucleotides N and E. These oligonucleotides introduce 
respectively the restriction sites Nco I and Eco Rl, permitting the cloning of the 
products obtained with these two sites. 

Each PCR reaction was prepared by adding 50 ng of the plasmid 
30 carrying the mutant gene to a mixture containing 10 jil of polymerization 
buffer, 10 |al of dNTPs 2mM, 20 pmol of each oligonucleotide N and E, and 5U 
of Taq DNA polymerase, in a final volume of 100 \x\. This mixture was 
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incubated in a Perkin-Elmer 9600 thermocycler according to the following 
program: (94°C - 2 min.) - (94°C 15 sec. - 60°C 30 sec. - 72°C 1 min.) x 29 
cycles - (72°C - 3 min.). 

5 The specificity of the genetic amplification was verified by restriction 

profile with the Pvu II endonuclease, by incubating 5 pi of each PCR product 1 
hour at 37 °C in a mixture containing 3 pi of restriction buffer A and 5U of the 
Pvu II enzyme in a final volume of 30 pi. 15 pi of that digestion reaction were 
loaded on a TBE 1% agarose gel. After migration and staining with ethidium 

10 bromide, the gel was exposed to ultraviolet. The visualization of the 
restriction fragments permitted confirmation of the specificity of the genetic 
amplification of each mutant gene. 

In parallel, 3 pi of each PCR reaction were loaded on a TBE 1% 
15 agarose gel . After migration, the gel was treated as above. The intensity of 
each band permitted the assessment that the genetic amplifications had the 
same yield. 
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-4* 2 ) Creation of libraries of restriction fragments . 

20 

50 pi of each of the ten PCR were mixed and loaded on a 1% TBE agarose 
gel. After migration and staining with ethidium bromide, the band at 2572 bp, 
corresponding to the amplification product of the genes of the ten mutants, 
was cut out with a scalpel and purified with the Quiaquick system (QIAGEN). 
25 Allthe DNA thus purified was eluted in 120 pi of buffer T. The concentration of 
this DNA was approximately 100 ng/pl according to its absorbance at 260 nm. 

In order to generate the libraries of restriction fragments, 100 pJ of this 
DNA were incubated for one hour at 50°C in a mixture containing 12 pi of 
30 restriction buffer B, 1.2 pi of BSA (at 10 mg/ml), 25 U of the enzyme Bsa I and 
4 pi of water. Then, 2 pi of restriction buffer B, 2 pi of BSA (at 1 mg/ml), 50 U 
of the enzyme Hinf I and 1 1 .5 pi of water were added to the mixture, which 
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was incubated for one hour at 37 °C. The digestion mixture was purified on a 
QIAquick column (QIAGEN), and eluted with 30 pi of buffer T. 1 pi of this 
eluate was loaded on a 1% TBE agarose gel in order to verify that the 
digestion had been total, and that it had generated 6 restriction fragments, 
and consequently six libraries of fragments, of 590 bp, 500 bp, 472 bp, 438 
bp, 298 bp and 274 bp. The concentration of this DNA was approximately 
250 ng/pl according to its absorbance at 260 nm. 

V RLR (Recombinatinq Ligation Reaction). 

The RLR reaction (Recombining Ligation Reaction) was carried out by 
incubating determined quantities of restriction fragments Hinf I - Bsa I from 
the genes of ten mutants with the complete template (i.e., the wild type ponB 
gene), in the presence of a thermostable DNA ligase. The table IV below 
reports the composition of the mixtures for RLR. 



Table IV 





RLR 1 


RLR 2 


RLR 3 


RLR 4 


T- 


Fragments Hinf 1 - Bsa 1 
of ten mutants 
(100 ng/pl) 


0.5 pi 


1 pi 


2 pi 


5 pi 


5 pi 


Wild type ponB 
template 
(100 ng/pl) 


0.6 pi 


1.2 pi 


2.4 pi 


6 pi 


6 pi 


AMPLIGASE 10X Buffer 


2 pi 


2 pi 


2 pi 


2 pi 


2 pi 


AMPLIGASE (25 U/pl) 


1 pi 


1 pi 


1 pi 


1 pi 




H 2 0 




qsp 20 
pi 


qsp 20 
Ml 


qsp 20 
Ml 


qsp 20 
Ml 


qsp 20 
pi 
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The negative control is identical to the reaction of RLR4, but does not 
contain thermostable DNA ligase. These different mixtures were covered with 
a drop of mineral oil and incubated in a Perkin-Elmer 9600 thermocycler in 
200 pi microtubes according to the following program: ( 94 °C, 5 min.) - 
(94°C, 1 min. - 65°C, 4 min.) x 35 cycles. 

10 vi of each RLR reaction were then added to a PCR reaction mixture 
containing 10 pi of polymerization buffer, 10 pi of 2 mM dNTPs, 40 pmol of 
each oligonucleotide A1 and A2, and 5 U of Taq DNA polymerase in a final 
volume of 100 jjJ. This mixture was incubated in a Perkin-Elmer 9600 
thermocycler according to the following program: (94°C, 5 min.) - (94°C, 30 
sec. - 46°C, 30 sec. - 72°C, 1 min.) x 29 cycles - (72°C, 2 min.). This PCR 
reaction permitted specific amplification of the ligation products formed in the 
course of the RLR reaction, without amplifying the template, since the 
oligonucleotides A1 and A2 are not able to hybridize with the template (it), as 
shown in Figure 4. 

5 uJ of each RLR reaction and 10 of each of the previous PCR 
reactions were loaded on a 1% TBE agarose gel. After staining with ethidium 
bromide, the gel was exposed to ultraviolet light, as shown in Figure 5. 

The analysis of this gel reveals that only the reaction of RLR4 contains, 
as the negative control, restriction fragments still visible (tracks 4 and 5). 

The absence of PCR product for the negative control (track 10) reveals 
not only that the PCR reaction is specific (no amplification of the complete 
template), but also that the restriction fragments present in the mixture cannot 
be substituted for the primers to generate a contaminant PCR product under 
the chosen conditions. In parallel, the presence of a unique band at about 
2500 bp in tracks 6, 7 and 8 demonstrates that an RLR product was able to 
be amplified by PCR for the RLR1, 2 and 3 reactions. These three RLR 
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reactions therefore permitted the regeneration of one or more of the complete 
genes starting from six libraries of restriction fragments. 

VI ANALYSIS OF THE AMPLIFICATION PRODUCTS OF THE 
RLR REACTIONS 

i) Cloning 

The PCR amplification products of the RLR 1 , 2 and 3 reactions were 
purified with the Wizard PCR Preps system (PROMEGA) and eluted in 45 jil 
of buffer T. 6 \i\ of each purified PCR were incubated 1 hour at 37 °C in a 
mixture containing 3 ul of restriction buffer C, 3 ul of BSA (1 mg/ml), 20 U of 
the Eco Rl enzyme, 10 U of the Nco I enzyme and 15 ul of water. 

In parallel, two vectors (pARAPONB and pET26b+) were prepared for 
the cloning. These vectors were linearized by incubating 3 ug of these 
plasmids for 2 hours at 37 °C, in a mixture containing 3 ul of restriction buffer 
C, 3 ul of BSA (1 mg/ml), 20 U of the Eco Rl enzyme, 10 U of the Nco I 
enzyme and 19 ul of water. 

The linearized vectors as well as the digested PCR were purified on a 
TBE 1% agarose gel with the QIAquick system (QUIAGEN). Each vector or 
each digested PCR was eluted in 30 ul of buffer T. 

The ligation of each PCR digested with each of the vectors was carried 
out according to the conditions described in table V below, and incubated at 
16 °C for 16 hours. 



Table V 





Ligation with the vector 
pARAPONB 


Ligation with the vector pET26b+ 




LpAR1 


LpAR2 


LpAR3 


TLpAR 


Lpb 1 1 


Lpb 1 2 


Lpb I 3 


TLpET 
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PCR amplification 
RLR 1 digested 
Nco I - Eco Rl 
PCR amplification 
RLR 2 digested 
Nco I - Eco Rl 



4 |il 



PCR amplification 
RLR 3 digested 
Nco I - EcoRI 



4 |il 



Vector pARAPONB 
digested Nco I- 
Eco Rl 



1 \i\ 



Vector pET26b+ 
digested Nco I- 

Eco Rl 
Ligation Buffer 
Ligase 



2 |il 
12~|il 



2 jil 
Tjll 

12^1 



2 Hi 
12~^l 




2 |il 

i6~iii 



4 til 



1 nl 



2 nl 

1~2lll 



4 nl 



1 nl 



2 ^1 

i~mT 

12^1 



4 |il 



1 |il 



2 |il 
12~|il 



200 ii\ of chimiocompetent MC1061 cells (4) were transformed with 10 
of each ligation by a thermal shock (5), and the cells thus transformed were 
spread over a selection medium. 

No clone was obtained after transformation of ligation controls TLpAR 
and TLpET, thus indicating that the Nco I - Eco Rl vectors pARAPONB and 
pET26b+ cannot undergo an intramolecular ligation. 



1 |Ll 



16 |il 



2) Screening bv PCR 

A first screening of the clones obtained after transformation of the 
ligations with the vector pARAPONB was carried out by PCR. 42 colonies, 14 
from each ligation LpAR1 , LpAR2 and LpAR3 ( were resuspended individually 
in a PCR mixture containing 5 \i\ of polymerization buffer, 40 pmol of each 
oligonucleotide A1 and A2, 5 nl of 2 mM dNTPs and 5U of Taq DNA 
polymerase in a final volume of 50 pi. A negative control was obtained by 
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adding to the PCR mixture 50 ng of the plasmid pBR322 in place of the 
colony. These 43 tubes were incubated in a Perkin-Elmer 9600 thermocycler 
according to the following program: (94°C, 5 min.) - (94°C, 30 sec. - 46°C, 30 
sec. - 72°C, 1 min.) x 29 cycles - (72°C, 2 min.). 5 pi of each of these PCR 
reactions were then incubated for 1 hour at 37 °C in a mixture containing 2 pi 
of restriction buffer A, 2 pi of BSA (1 mg/ml) and 5 U of the restriction enzyme 
Pvu II in a final volume of 20 pi. 

10 pi of each of these digestions were loaded on a TBE 1% agarose 
gel in parallel with 5 pi of each non-digested PCR (thus avoiding possible 
confusion of non-specific bands of the PCR with a fragment obtained by 
restriction digestion). After migration and staining of this gel with ethidium 
bromide, the bands resulting from the digestion by the enzyme Pvu II were 
analyzed in order to determine which fragment(s) of initial mutants was/were 
associated with the others in order to reconstruct an entire gene. This 
screening reveals the presence of 27 genes carrying one mutation, 7 genes 
carrying two mutations and 8 genes no longer carrying any mutation. 

3) Screening by plasmidic Q NA minipreparation 

The second screening was effected by carrying out an extraction of the 
plasmidic DNA (5) from 21 clones resulting from the transformation of the 
ligations with the vector pET26b+ (7 clones of each ligation). 5 pi of the 
plasmidic DNA thus obtained for each clone were incubated for 1 hour at 37 
°C in a mixture containing 1 pi of restriction buffer C, 6 U of the enzyme Pst I, 
3 U of the enzyme Nco I and 6 U of the enzyme Eco Rl in a final volume of 10 
pi. 5 pi of each of these digestions were loaded on a TBE 1% agarose gel. 
After migration and staining of this gel with ethidium bromide, the bands 
resulting from the digestion by the Pst I enzyme were analyzed in order to 
determine which fragment(s) of the initial mutants had associated with the 
others in order to reconstruct an entire gene. This screening reveals the 
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presence of 13 genes carrying a mutation, 5 genes carrying two mutations 
and 3 genes no longer carrying a mutation. 

4) Statistical Analysis of the recombinations. 

According to the position of each mutation as compared to the cutting 
sites of the enzymes Hint I and Bsa I, as represented in Figure 6, it is possible 
to calculate the probability of obtaining in the course of the RLR reaction the 
creation of a gene carrying 0,1,2, 3, or 4 of the mutations of the initial genes. 

Thus, by considering that the RLR reaction is totally random the 
probabilities P are as follows: 



P(0 mutation) = 



P(l mutation) = 



P(2 mutations) = 



Wo) 



n = l 



= 30.24% 



— n 



10 — i 



10 J 



44.04% 



n = \ 



g (10- aY 10-(a + w) 



a J 



v a + n 



10. 



= 21.44% 



P(3 mutations) = X 



10-n 



n 



n lis 



= 4.04% 



P(4 mutations) = 



4 ( i \ 
J! — =0.24% 

ii vio^ 



The two screenings carried out give results close to these statistical 
predictions, as reported in table VI below, thus indicating that the RLR 
reaction is quasi-random. A slightly higher proportion of genes carrying one 
mutation, to the detriment of the genes carrying zero mutation, is observed. 
This phenomenon could be attributed to a weak toxicity of the ponB gene 
already observed and to the slight of expression leakage of vectors 
pARAPONB and pET26b+, which would favor the selection of genes carrying 
an inactivatingmutation. 
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Table IV 



% 


0 mutation 


1 mutation 


2 mutations 


3 mutations 


4 mutations 


Statistics 


30.24 


44.04 


21.44 


4.04 


0.24 


PCR 
Screening 


21 


63 


16 


0 


0 


Mini- 
preparation 
Screening 


14 


62 


24 


0 


0 
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